System and method for distributed load balancing with load balancer clients for hosts

ABSTRACT

Embodiments may include a load balancer configured to, for a given packet received from a remote client, select a host computer of a plurality of host computers according to a load balancing protocol, the selection based on load information updates received from the plurality of host computers. Each load information update may indicate a measure of the respective host computer&#39;s capacity to service additional connections with remote clients. The load balancer may provide the given packet to the selected host computer for processing by an application component on the selected host computer, and subsequent to determining that a specific application component on one of the plurality of host computers does not have capacity for additional connections based on the load information updates, continue to send packets for existing connections serviced by the specific application component to the respective host computer without sending packets for new connections to that host computer.

This application is a divisional of U.S. application Ser. No.13/167,557, filed Jun. 23, 2011, now U.S. Pat. No. 9,055,076, which ishereby incorporated by reference in its entirety.

BACKGROUND

Large networks that service client requests often benefit from the useof load balancers. For instance, in a network that includes multipleback-end servers that process client requests, a front-end load balancermay attempt to evenly distribute incoming requests across the servers inorder to prevent any one server from becoming overloaded. While loadbalancers generally improve the overall efficiency of large distributednetworks with many servers, there are circumstances in which theshortcomings of certain load balancers may outweigh the performancegained from their load balancing features. For instance, certain systemsrequire large transmission control protocol (“TCP”) window sizes, whichcan be very demanding on a load balancer that has fixed memoryresources. In some cases, the load balancer may attempt to shrink windowsizes to compensate for inadequate memory resources; however, thistypically results in poor connection performance for clients asshrinking window sizes may cause a sharp increase in the ratio ofoverhead data to useful data for a given connection. Direct serverreturn (DSR) is a technique that can overcome some limitations of loadbalancers in certain implementations. However, DSR's reliance on MediaAccess Control (MAC) addresses for packet delivery may be unacceptablefor complex network topologies.

Additionally, load balancers typically perform decisions based on coarsecharacteristics of back-end servers. For instance, in some cases, loadbalancers may base decisions on information that specifies whether eachback-end server is “up” or “down” (e.g., either operating or notoperating, respectively). Furthermore, in systems with multiple loadbalancers, different groups of back-end servers are typically assignedto specific load balancers. To move a back-end server from thedistribution domain of one load balancer to another typically requiresterminating active connections on that back-end server as well asperforming configuration changes on the relevant load balancers.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a block diagram of an example system configurationincluding ingress hosts, egress hosts, server hosts, and remote clients,according to some embodiments.

FIG. 2 illustrates a flow diagram for handling request and responsepackets including the ability to retain the client's network address inthe request packet for evaluation by the server host as well as theability for the server host to serve as a connection endpoint (e.g., asopposed to a connection terminating on the ingress host), according tosome embodiments.

FIG. 3 illustrates a block diagram of an example remote client,according to some embodiments.

FIG. 4 illustrates a block diagram of an example ingress host, accordingto some embodiments.

FIG. 5 illustrates a block diagram of an example server host, accordingto some embodiments.

FIG. 6 illustrates a flow diagram of the load information reportingprocess between a server host and ingress host (e.g., load balancer),according to some embodiments.

FIGS. 7A-7D illustrates flow diagrams illustrating the manner in whichconnection volume fluctuates in response to different load updatesreported by server hosts, according to some embodiments.

FIG. 8A-8C illustrate flow diagrams that demonstrate a server host usingload updates to transition its source of connections from one ingresshost to another, according to some embodiments.

FIG. 9 illustrates a flow chart of an example method for processing arequest packet including tracking the packet's original destinationaddress through the use of one-to-one mappings between ports anddifferent network addresses, according to some embodiments.

FIG. 10 illustrates an example method performing a load balancingtechnique for allocating connections away from a particular server hostwithout closing existing connections for that host, according to someembodiments.

FIG. 11 illustrates an example method for transitioning the source ofconnections for a given host computer (e.g., a server host) from oneload balancer to another load balancer, according to some embodiments.

FIG. 12 illustrates one example of a computer system suitable forimplementing various elements of the system and method for distributedload balancing with load balancer clients for hosts, according to someembodiments.

While the system and method for distributed load balancing with loadbalancer clients for hosts is described herein by way of example forseveral embodiments and illustrative drawings, those skilled in the artwill recognize that the system and method for distributed load balancingwith load balancer clients for hosts is not limited to the embodimentsor drawings described. It should be understood, that the drawings anddetailed description thereto are not intended to limit the system andmethod for distributed load balancing with load balancer clients forhosts to the particular form disclosed, but on the contrary, theintention is to cover all modifications, equivalents and alternativesfalling within the spirit and scope of the system and method fordistributed load balancing with load balancer clients for hosts asdefined by the appended claims. The headings used herein are fororganizational purposes only and are not meant to be used to limit thescope of the description or the claims. As used throughout thisapplication, the word “may” is used in a permissive sense (i.e., meaninghaving the potential to), rather than the mandatory sense (i.e., meaningmust). Similarly, the words “include,” “including,” and “includes” meanincluding, but not limited to.

DETAILED DESCRIPTION OF EMBODIMENTS

Various embodiments of a system and method for distributed loadbalancing with load balancer clients for hosts are described. Asdescribed in more detail below, embodiments may include componentsconfigured to perform distributed load balancing techniques thatdistribute data packets to back-end host systems (e.g., web servers).For instance, a client computer may send request packets and a hostcomputer may send response packets to establish a packet connection(e.g., as part of a handshaking protocol); embodiments may utilize oneor more load balancers to distribute such connections across multipleback-end host computers according to a load balancing protocol. In manycases, the network(s) between the load balancers and the multipleback-end host computers may be relatively complex networks including anyof a variety of networking components including but not limited torouters (e.g., IP routers), switches, hubs, bridges and othercomponents. As such, embodiments may avoid reliance on routingtechniques that require knowledge of the physical network topology(e.g., conventional DSR) as the complexity and/or fluid (e.g.,frequently changing) nature of such topology renders such relianceineffective or impractical. Instead, embodiments described herein mayenable inbound packets to be routed to back-end hosts according one ormore network protocols (e.g., Internet Protocol (IP)). In these cases,when a packet traverses the load balanced system, the original sourceaddress (e.g., the client's network address) may be preserved as thesource address within the packet. In this way, a receiving system (e.g.,a back-end host) may perform any access control functions that rely onthe original source address. Furthermore, when a packet traverses theload balanced system, the destination address of the packet may beupdated to reflect the next destination within the load balanced system,whether that destination is a back-end system or some intermediatedestination between the entry point of the load balanced system and theback-end system. As such, the packet remains routable according to theaforesaid networking protocols.

In various embodiments, in addition to the original source address(e.g., the client's network address) and the address of the nextdestination within the network path traversed by a packet within theload balanced system, one or more components of the load balanced systemmay utilize the original destination network address to which a client'srequest packet was sent. For instance, a host server that sends aresponse packet may write that original destination address as thesource of the response packet. In this way, when a response packet issent to a client, the response packet may appear (from the client'sperspective) to originate from the network address to which the clientsent the original request packet. As an example, this original networkaddress may be a network address of the entry point into the loadbalanced system whereas the component that sends the response packet mayhave a different network address. Furthermore, some connection types(e.g., cryptographically protected connections) may prevent packets fromconveying the original destination address through conventionaltechniques (e.g., x-forwarded-for). As such, embodiments may provideback-end hosts with knowledge of a packet's original destination address(e.g., an external network address for the load balanced system) withoutactually specifying that address within information of the packet thatis delivered to that back-end host. As described in more detail below,the manner in which the original client's network address is conveyed tothe back-end host (without including that address in information of thepacket) may include sending packets to a specific port on the back-endhost. This specific port may have a one-to-one mapping or associationwith the original destination address. In this way, the back-end hostmay compare the port on which a packet is received to this one-to-onemapping in order to determine the original destination address. Asdescribed above, this original destination address may be written as thesource of any response packets sent to the client such that the responsepackets appear to (from the client's perspective) to originate from thenetwork address to which the client sent the original request packet.

In addition to preserving access control functionality for back-endhosts, embodiments may also shift connection endpoints from the loadbalancers to one or more of the back-end hosts. As described in moredetail below, cryptographically protected connections may be terminatedby load balancer clients instead of the actual load balancers. Loadbalancer clients may be implemented on the actual back-end hosts thatservice connections (or on other hosts in some cases). Whetherimplemented on the actual back-end host or another system, a back-endhost's load balancer client may be configured to terminate connectionsincluding but not limited to cryptographically protected connections. Inthis way, the load balancer clients may accommodate connections withlarge TCP window sizes without exerting undue stress on the hardwareresources of the load balancers. Furthermore, in many cases, thehardware on which the load balancer clients are implemented may be lessexpensive than the load balancers, thereby representing a cost savingsfor the entity that owns the load balanced system.

Embodiments may also be configured to perform connection level loadbalancing based on load updates from back-end hosts. While the actualload balancing protocol may be performed by one or more load balancers,a given back-end host may influence or partially control the quantity ofconnections that it receives by reporting specific types of load updatesto one or more of the load balancers. For instance, a back-end host islikely to receive more connections from a particular load balancer afterproviding that load balancer with a load update that specifies theback-end host is lightly loaded or otherwise has capacity to serviceadditional connections. Similarly, a back-end host is likely to receivefewer connections from a particular load balancer after providing thatload balancer with a load update that specifies the back-end host isheavily loaded or otherwise has little capacity to service additionalconnections. As described in more detail below, the back-end hosts maybe configured to concurrently report these types of load updates tomultiple load balancers. In this way, a back-end host may use loadupdates to control the maximum quantity of connections that it mayreceive from different load balancers. As a given load balancer may nothave knowledge of the connection load exerted on a host by other loadbalancers, the different load updates sent to different load balancersmay enable a given back-end host to service connections from thosebalancers without the risk that one of the load balancer swill overwhelmthe back-end host with too many connections. For example, the back-endhost may determine its total capacity to service connections and ensurethat the individual load updates sent to different load balancers do notcollectively indicate a total load that exceeds this capacity. Forinstance, if a back-end host determines that it has capacity to service400 additional connections, the back-end host may send to one loadbalancer a load update that indicates the host is able to accept 100additional connections and send to another load balancer a load updatethat indicates the host is willing to accept 300 additional connections.

FIG. 1 illustrates a block diagram of an example system configuration inwhich distributed load balancing may be implemented. As illustrated oneor more client systems (or “clients”) 100 a-e may be configured tocommunicate with one or more network components through networks 102a-b, which may be configured in a manner similar to that of network 1285of FIG. 12 described below. For example, any of the illustrated clientsmay represent a computer system that requests information or data fromserver hosts 114 a-d. One example of a client system is described inmore detail below with respect to FIG. 3. Generally, clients may seek toestablish connections through which to communicate with one or moreserver hosts in order to exchange information. As described in moredetail below, these connections may be cryptographically-protected (suchas SSL connections). In various embodiments, the illustrated serverhosts may provide information or data representing a marketplace ofgoods or services that may be purchased by users of the clients systems.For instance, a user may utilize a client to log on to an electronicmarketplace to purchase books or music. The server hosts may provideproduct detail pages that represent books, music, or other productsavailable for purchase. One example of the manner in which a connectionbetween a client and a server host may be established is described belowwith respect to FIG. 2.

In various embodiments, the system may include one or more ingress hosts104 a-b for processing inbound data from clients 100 a-e. The ingresshosts may perform be responsible for receiving incoming requests anddistributing these requests to server hosts 114 a-e according to a loadbalancing protocol. While the load balancing aspects of the ingresshosts are described in more detail below, the load balancing techniquesutilized by each ingress hosts generally seek to distribute work amongthe server hosts such that the hosts are evenly loaded with connectionsto remote clients. In some embodiments, different server hosts may havedifferent capabilities. For instance, one server host may be capable ofservicing 10 times the number of connections that can be serviced by aless capable server host. For instance, some hosts may include morehardware resources (e.g., processors, memory, etc.) than other hosts. Asdescribed in more detail below, each server host may periodically oraperiodically report information about its capacity to perform serviceadditional connections with remote clients to the ingress hosts. Theingress hosts may take this load information into consideration whendistributing connections among the server hosts.

For new and existing connections, ingress hosts may forward packets fromthese connections to the appropriate server hosts over one or morenetwork(s) 108, which may be configured in a manner similar to that ofnetwork 1285 of FIG. 12 described below. The load balancer (LB) clients112 a-d may receive such packets from the ingress hosts and pass therelevant information to servers 110 a-d. The load balancer client may beresponsible for decrypting encrypted packets and other duties asdescribed in more detail below with respect to FIG. 2. Furthermore,while each server host 114 a-d is illustrated as including a server 110a-d, the configuration need not be limited to this type of applicationcomponent. In various embodiments, servers and LB clients may beimplemented in software, hardware, or some combination thereof. Asillustrated, the load balancer clients may be implemented on the samehosts as the server application. However, in various embodiments, theload balancer clients may be implemented on different hosts. In somecases, each load balancer client may be dedicated to a respective one ofservers 110 a-d.

After receiving a packet (or information from a packet) from therespective LB client, the server component may process the informationand generate a response to the remote client. For instance, the responsemight be an acknowledgement (a transmission control protocol (“TCP”)ACK) to a connection setup request (e.g., a TCP SYN) submitted by one ofthe clients. In another example, a response might include data orinformation requested by the client, such as network-based content(e.g., product detail pages for the electronic marketplace mentionedabove). As described in more detail below, the generated response may besent direct from the server hosts or may be sent to one of egress hosts106 a-c, which may be responsible for forwarding response packets fromthe server hosts to the appropriate clients.

It should be noted that each server 110 a-d may accept traffic from anynumber of ingress hosts. In various embodiments, this system propertymay provide efficient utilization of the system as less server capacityneeds to be reserved in the event of a load balancer failure.Furthermore, the number of egress hosts may differ from the number ofingress hosts. For example, depending on the specific implementation,outbound traffic may be larger than inbound traffic on average overtime. In one non-limiting example, the volume of outbound traffic couldbe twice as much as that of inbound traffic. Accordingly, in variousembodiments, servers may be added in proportion to the amount of trafficflowing in each direction. As the limiting factor for traffic throughthe ingress and egress host may be packets per second in variousembodiments, splitting the roles in this manner may enable ingress hoststo focus on receiving packets such that a larger volume of inboundtraffic may be handled (relative to traffic capacity of ingress hostsabsent the assistance of egress hosts processing outbound traffic).

The illustrated system may also include a configuration data store 116,which may be utilized to store port-to-address mappings. For example,the ingress hosts may service multiple network addresses to whichclients may send requests. In various embodiments, multiple addressesmay route to the same ingress host (e.g., as specified by a Domain NameSystem (DNS). These addresses may be mapped to specific ports foringress hosts, server hosts and egress hosts. In this way, when a packetis passed to a component on a specific port within the illustratedsystem, that component may compare the port number to the mappings ofthe configuration data store to determine the associated networkaddress. This process is described in more detail below with respect toFIG. 2. In general, this mapping information may enable the ingresshosts, server hosts and egress hosts to determine a network addressassociated with a packet even though the packet itself may not includeinformation that specifies that address.

In the illustrated embodiment, the network boundary 118 may isolate theserver side network from networks 102 a-b. For instance, in addition tothe other functionality described herein, ingress and egress hosts mayperform network firewall duties to prevent unauthorized access to theserver side components. Accordingly, in various instances herein, theserver side components (e.g., ingress hosts, egress hosts, server hosts)may be referred to as collectively forming a particular network region.Generally speaking, this particular network region may provide a levelof security that is higher than that provide by outside networks (e.g.,the Internet). However, some embodiments may be implemented withoutrequiring this level of security.

In various embodiments, this particular network region may be accessibleto the remote client through a plurality of network addresses, such asmultiple virtual internet protocol (IP) address that route to differentones of the ingress hosts. These addresses may be refereed to herein aspublic addresses as they are the addresses that systems (e.g., remoteclients) outside of this particular network region For example, a clientmay send a request to a public address that is one of multiple publicaddresses that route to ingress host 104 a. In various embodiments,components of network(s) 102 a-b may be configured such that differentpublic address route to different ports on the respective ingress host.In this way, if the original destination address specified by the clientin a packet is overwritten en route to the ingress host, the ingresshost may determine which public address the client originally sent thepacket to by determining which public address is associated withwhatever port the ingress host received the packet on. This mapping ofpublic addresses to ports may be stored within configuration data store116 for each ingress host. Alternatively, each ingress host may storethis information locally. In various embodiments, each server host andeach egress host may also store port to public address mappings withinconfiguration data store 116. For a given component (e.g., server hostor egress host), this mapping information may specific one-to-onemappings between ports of that component and public addresses of theparticular network region. In this way, when that component receives apacket on a particular port, the component may determine which publicaddress the packet associated with even if the packet itself does notinclude information specifying that address. In various embodiments, theport mapping information (stored in configuration data store 116) forany component of the particular network region may be accessible to anyother component of the particular network region. In this way, when afirst component needs to convey the public address associated with apacket to a second component (without writing the public address withinthe packet), the first component may send the packet to the secondcomponent's port that is associated with that public address within themapping information. Use of the mapping information in this manner isdescribed in more detail with respect to FIG. 2 below.

FIG. 2 illustrates a data flow diagram that illustrates packet flowaccording to various embodiments. The illustrated system may utilizedistributed direct server return (DDSR) to move traffic around thenetwork while retaining the source network address on the packet (i.e.,the network address of the remote client that sent the packet).Utilizing DDSR enables a cryptographically-protected connection to beterminated at the server host (e.g., decrypted at the server host)instead of the load balancer (e.g., the ingress host) while alsoensuring that the server host has knowledge of the source networkaddress for enforcing access control policies.

In the illustrated embodiment, client 100 a may send a request packet200 to the example network address 172.1.1.1:80, which may be theexternal network address of the ingress host 104 a in the illustratedembodiment. In various embodiments, request packet 200 may be routed tothe ingress host according to a domain name system (DNS) and the openshortest path first (OSPF) routing protocol. Note that the externalnetwork address may be one of multiple different external addresses thatnetwork 102 a (e.g., the Internet) routes to the same ingress host 104a, which has an example internal network address of 10.1.1.1 in theillustrated embodiment (e.g., the address that components on the serverside of the illustrated network boundary use to address the ingresshost).

Ingress host 104 a may include a load balancer (e.g., load balancer 400described in more detail below). The ingress host may utilize the loadbalancer to select a host to which the request packet is to be sent. Invarious embodiments, this selection may be performed according to a loadbalancing protocol and may also be based on load information updatesfrom the server hosts, which is described in more detail with respect tosubsequent figures. In the illustrated example, the ingress host selectsserver host 114 a as the recipient server. To send the packet to theserver host, ingress host 104 a may rewrite the destination address ofthe request packet to be the destination address of the selected serverhost 114 a. In the illustrated embodiment, the address utilized is theaddress of LB client 112 a (10.2.2.2). Furthermore, ingress host 104 aalso selects a particular port to send the packet to. This port may beselected based on the port mapping information for server host 104 a.This mapping information may be specified by load update 206. Loadupdates are described in more detail with respect to FIG. 6 butgenerally may include information that specifies a specific port onwhich the server host is willing to receive connections and anindication of the quantity of additional connections that server host iswilling to accept. In some cases, this information may be stored in aconfiguration data store 116, although this is not required in variousembodiments. The ingress host may select the particular port that mapsto the original destination address (172.1.1.1:80) within this mappinginformation. In this way, the ingress host may convey the originaldestination address of the packet to the server host without theoriginal destination address being included in the packet (because thehost server can use its mapping information to determine the originaldestination address from the port number of the entry port). In variousembodiments, this technique may be utilized when handling packets forencrypted connections, in which case the ingress host may not be able toperform the requisite packet inspection (e.g., a layer 7 inspection)that would be needed to insert the original destination address withinthe packet. Similarly, the packet encryption may prevent alternativetechniques (e.g., Hypertext Transfer Protocol's x-forwarded-for) frombeing utilized. In the illustrated embodiment, the example port numberselected is port 4001.

After selecting the server host according to the load balancing protocoland selecting the specific port number according to the mappinginformation for the selected server host, the ingress host may send thepacket to the server host, as illustrated by request packet 200.Additionally, when the ingress host sends the request packet to serverhost 114 a, the ingress host may ensure that the packet's source headercontinues to indicate the client's network address as the sourceaddress. Furthermore, the destination header specifies an address of theserver host 10.2.2.2 (in this case the address of the LB client) as wellas the port number selected according to the techniques described above(port 4001). In the illustrated embodiment, the entire address isexpressed in socket format as 10.2.2.2:4001.

By utilizing the techniques described herein, the ingress host mayensure that the packet will arrive at the correct server host (and port)even if the packet is to traverse one or more routers (e.g., IP routers)on the way to the server host. For instance, conventional direct serverreturn's (DSR) reliance on Media Access Control (MAC) addresses wouldfail under this scenario. By utilizing the techniques described above,the network topology may be complex and even change over time; as longas the packet has the correct network address (e.g., IP address) of theserver host, the packet can be routed to that server host correctly.Furthermore, by utilizing the techniques described herein, when theingress host sends the request packet, the packet continues to specifythe client network address 45.1.1.1:56223 as the source. In this way,the server host may successfully evaluate any access control policiesthat depend on the client network address. For instance, some accesscontrol policies may specify white-lists or blacks lists of specificclient network address that are permitted or prohibited access toinformation (e.g., a data object) of the server host.

If the requisite access control policies completed successfully, the LBclient may pass information from the packet to be processed by server110 a. If the packet includes encrypted information (e.g., as is thecase for SSL connections), the LB client may decrypt encryptedinformation before providing it to server 110 a. In other words, whetherhandling encrypted or unencrypted packets, the LB client 112 a may serveas the endpoint between remote clients and server host 114 a.Furthermore, the LB client may pass any necessary information to server110 a through any of a variety of communication protocols. In onenon-limiting example, the LB client forwards information to the serverover HTTP to the address of the server component (127.0.0.1). The server(or other application component) may generate a response packet 202which the LB client may send to the egress host.

The LB client may send response packet 202 to the egress host in amanner similar to that utilized by the ingress host when sending therequest packet to the server host. For instance, the LB client mayensure that the source address of packet 202 is also the network addressof the client (in this case 45.1.1.1:56223). (In this way, egress host106 a will know to which address the response packet should be sent.)Furthermore, the LB client may ensure that the destination addressspecifies the network address of the egress host and the port of theegress host that maps to the original network address (172.1.1.1:80)that the client sent its request to. This port to network addressmapping may be specified by the mapping information in configurationdata store 116. In this way, when egress host receives the responsepacket, the egress host may rewrite the source address of the packet tobe equal to this original network address. In this way, when responsepacket 202 arrives at client 110 a, it will appear to the client as ifthe response packet was sent from the same address to which the clientsent request packet 200.

As described above, the LB client may serve as the server-side endpointfor any connection with remote clients. In this way, the system mayavoid scenarios in which the ingress hosts may become a trafficbottleneck, such as when processing large TCP windows of data. Byutilizing the techniques described herein to cause connections toterminate at the LB client, the capacity (e.g., memory, processingcycles, throughput, etc.) of the server host 114 a may bear the burdenof handling the data of the connection. In some cases, the ingress hostmay even leave the payload of incoming packets on the network interfacebuffer (of the hosts network interface card) and evaluate/manipulate theheaders only. These techniques may provide significant cost savings inscenarios where load balancer hardware (e.g., the ingress hosts) exceedsthe costs of server host hardware because the quantity of ingress hostsmay be minimized or reduced.

In various embodiments, the items and components described above mayinclude elements not illustrated in previous Figures. FIGS. 3-5 provideadditional details as to the possible components of the clients, ingresshosts and server hosts described above. FIG. 3 illustrates a blockdiagram of a client computer, according to some embodiments. In variousembodiments, client 100 a may be a computer, such as computer 1200 ofFIG. 12. Furthermore, while the illustrated client is client 100 a, anyof clients 100 b-e may be configured in a similar manner. In theillustrated embodiments, the client may include an operating system 300and browser 302 (e.g., a web browser or other network-based browser). Invarious embodiments, the client requests described herein may originatefrom browser 302. In other embodiments, applications 304 may reside onthe client. These applications may also initiate request and otherpackets to be sent to the server hosts.

FIG. 4 illustrates a block diagram of an ingress host, according to someembodiments. In various embodiments, ingress host 104 a may be acomputer, such as computer 1200 of FIG. 12. Furthermore, while theillustrated host is ingress host 104 a, any of the ingress hosts may beconfigured in a similar manner. In the illustrated embodiment, theingress may include a load balancer 400. The load balancer 400 maydistribute incoming packets to server hosts according to one or moreload balancing protocols (e.g., a round robin protocol). As described inmore detail below with respect to FIGS. 6-8 b, the load balancingprotocol utilized by load balancer 400 determine a particular servicehost to route a packet to based on load information updates receivedfrom the various service hosts. In various embodiments, throughreporting different levels of load over time, this may enable the serverhosts to exercise some level of control over the load balancing process(which is also described in more detail below with respect to FIGS. 6-8b). Generally speaking, in addition to the rules set forth by the chosenload balancing protocol, the load balancer may favor server hosts thathave a higher capacity to handle connections with remote clients (e.g.,lightly loaded server hosts) over hosts that have a relatively lowercapacity to handle connections with remote clients (e.g., heavily loadedserver hosts).

Ingress hosts may also include forwarding tables 402 for recordingconnections between remote clients and server hosts. In variousembodiments, tables 402 may specify different states for variousconnections including but not limited to an initial indicating that aconnection has been established, and a closed state indicating that aconnection is closed or in the process of being closed. In variousembodiments the forwarding table may be keyed using a connection or flowidentifier based on packet tuples. For instance, a tuple may include butis not limited to some combination of the client address and the serverside address (e.g., the external network address to which the client iscommunicating, such as 172.1.1.1:80 in FIG. 2) for the connection. Invarious embodiments, when the load balancer receives a packet for aconnection that has already been established and is active according tothe forwarding table, the load balancer may forward the packet to thehost server specified by the forwarding table without applying a loadbalancing protocol. In other words, in various embodiments, the loadbalancer may load balance connections by applying a load balancingprotocol to the initial packet(s) (e.g., handshake packets) associatedwith establishing a new connection. After a connection is established,the packets may be forwarded according to forwarding tables 402. Asdescribed above with respect to FIG. 2, by sending a packet over thecorrect port, the ingress host may convey the original destinationaddress of the packet (e.g., the address to which the remote client sentthe packet) to the recipient server host without that address actuallybeing included in the packet. In various embodiments, the informationthat specifies which port a given server host is to receive packets isstored in server hosts states 408, which may be based on the loadupdates reported by the server hosts (e.g., receiving ports andrespective capacity for each port).

Ingress hosts may also store server host states 408 locally. In variousembodiments, this information may represent a local cache of the loadinformation updates described in more detail below with respect to FIGS.6-8 b. Generally, this information may specify up-to-date (e.g., nearreal-time) load or capacity information for each of the server hosts ona per-port basis. In addition to the prescribed rules of the loadbalancing protocol, the load balancer may use this information fordetermining where to distribute new connections. For instance, in anexample where load information indicates different states (e.g., lightlyloaded, moderately loaded, heavily loaded, no more connectionsaccepted), the load balancer may utilize a round-robin load balancingprotocol in conjunction with the currents load states. For instance, theload balancer may round-robin new connections to each host in the lowestload category that still has hosts in it. In other cases, server hoststates 408 may specify different ports of the server hosts and, for eachport, a respective connection capacity that the respective host iscapable of accommodating. The server host states 408 may be updatedperiodically or aperiodically as the ingress server receives new loadupdates from load balancer clients, such as the load updates describedin more detail below with respect to FIGS. 6-8 b.

FIG. 5 illustrates a block diagram of server host, according to someembodiments. In various embodiments, server host 114 a may be acomputer, such as computer 1200 of FIG. 12. Furthermore, while theillustrated host is server host 114 a, any of the server hosts may beconfigured in a similar manner. In the illustrated embodiment, theserver host may include a LB client 112 a for accepting incomingconnection and performing any necessary decryption. As described above,the LB client may serve as the server-side endpoint for any connectionwith remote clients. In this way, the system may avoid scenarios inwhich the ingress hosts may become a traffic bottleneck, such as whenprocessing large TCP windows of data. The server host may also include aserver 110 a (e.g., a server application, process or other component)configured to process requests from the remote clients. In onenon-limiting example, server 110 a is a web server configured to respondto client requests with web page data. In another non-limiting example,server 110 a may serve content associated with an electronicmarketplace, such as product detail pages or transaction information.Any of this information may be provided in response to remote clientrequests, such as the request packets described above. Furthermore, theresponses provided by server 110 a may be returned as one or moreresponse packets to the respective remote client.

In various embodiments, the server host may also include inbound flowtable(s) 504. In various embodiments, these tables may map inboundconnections or packet flows (e.g., as identified by packet tuples) tospecific public addresses serviced by the ingress host. In this way,when a packet is received on a specific port (e.g., request packet 200),the server host may determine what public address the client sent thatpacket to (e.g., 172.1.1.1:80 in FIG. 2). The server host may alsoinclude outbound flow tables 506, which may map connections or packetflows (e.g., as identified by packet tuples) to different ports of theegress hosts. This table may in some cases serve as a local cache ofinformation from configuration data store 116. As described above withrespect to FIG. 2, by sending a packet over the correct port, the serverhost may convey the original destination address of the packet (e.g.,the address to which the remote client sent the packet) to the egresshost without that address actually being included in the packet.Furthermore, by utilizing tables 502 and 504 keyed to connection or flowidentifiers (e.g., based on packet tuples), embodiments may ensure thatoutbound packets of a particular connection are consistently routedthrough the same egress server thereby preserving packet ordering on theclient side in many cases.

Furthermore, as described above, the server hosts may utilize theclient's network address to implement access control policies. Forinstance, server 110 a may validate the client's network address againstan access control policy prior to giving that client access to certaininformation, such as customer account information or protected dataobjects. For instance, if the client's network address is not permittedper the access control policy, requests from that client may berejected. To ensure that this access control evaluation may be performedby server 110 a, load balancer client 112 a may provide the client'snetwork address to server 110 a through any communication techniqueincluding but not limited to HTTP's x-forwarded-for. Note that invarious embodiments the load balancer client 112 a may utilizex-forwarded-for to convey the client's network address as communicationbetween the load balancer client and the server component need not becryptographically protected even in cases where the connection betweenthe external client and the load balancer are cryptographicallyprotected.

As described above, the server hosts may be configured to report loadinformation updates to the ingress hosts. Additionally, the ingresshosts may be configured to utilize this information to throttle thequantity of connections that are given to the individual server hosts.FIG. 6 illustrates a flow diagram illustrating load reporting and loaddistribution, according to some embodiments. In the illustratedembodiment, server host 114 a may provide one or more load informationupdates (or simply “load updates”) 600 to ingress host 104 a. In variousembodiments, load updates may be provided from server hosts to ingresshosts on a periodic or aperiodic basis. Generally, the load updates mayspecify a server host's capacity to service additional connections withremote clients. For instance, in various embodiments, a given loadupdate 600 may specify a quantity of additional connections that theserer host is capable of handling. In some cases, this may be a quantitydetermined by the server host itself. In other cases, the loadinformation update may specify information that the ingress host may useto infer the server hosts capacity to accept additional connections. Forinstance, a given load update may specify a current number of activeconnections on the server host, current bandwidth utilization of theserver host, current processor utilization of the server host, currentmemory utilization of the server host, or some other performance metricassociated with the server host. In yet other cases, load update 600 mayspecify a load level from a predefined scale of multiple load levels.For instance, such a scale may include a range of load levels fromlightly loaded, to moderately loaded, to heavily loaded, to a level inwhich no more connections are accepted by the server host. In variousembodiments, load updates from a server host may be port specific. Forinstance, the load updates may specify different quantities ofconnections that the server host is willing to accept on different portnumbers and these port numbers may vary across different load balancers.For example, as described above, if a server host determines that it hascapacity to service 400 additional connections, the server host may sendto one load balancer a load update that indicates the host is able toaccept 100 additional connections and send to another load balancer aload update that indicates the host is willing to accept 300 additionalconnections. In some cases, a server host may provide different loadupdates for different ports to the same load balancer. For instance, forthe same load balancer, the server host may use load updates to specifycapacity for unprotected connections on one port and capacity forcryptographically protected connections (e.g., SSL connections) onanother port.

In various embodiments, the configuration of FIG. 6 may differ fromconventional load balancer polling techniques in a number of respects.For instance, instead of the load balancer polling the back-end hosts todetermine whether the hosts are up or down, the load balancer clientsmay proactively report the load updates described herein to the loadbalancers of the ingress hosts. Furthermore, instead of simplyindicating whether a server is up or down, the load updates may givesome indication as to the actual capacity the server host has to serviceadditional connections. For example, as described above, the loadbalancer client may specify the different capacities for differentports.

As described above, the ingress host may throttle the amount ofconnections to the server host based on the load updates. In variousembodiments, this throttle may be a result of the manner in which theload balancer of the ingress host incorporates the load updates into theload balancer protocol for selecting a server host. Generally, serverhosts with more capacity to handle additional connections may beassigned more connections that server hosts with less capacity to handleadditional connections. In some embodiments, the capacity of hosts maychange over time as connections are closed or opened. This variabilitymay be reflected in the load updates that the server hosts provide tothe ingress hosts.

FIGS. 7A-7D illustrate flow diagrams demonstrating how the quantity ofconnections sent to a given server host varies over time as that serverhost provides updated load information to the ingress hosts. In FIG. 7A,server host 114 a provides ingress host 104 a with a load update thatindicates the server host is lightly loaded with existing connection(s)701 a. As the ingress host 104 a takes the load update intoconsideration when performing the load balancing protocol, the ingresshost 104 a may assign the server host with a relatively large quantityof new connections 702 a. In FIG. 7B, server host 114 a provides ingresshost 104 a with a load update that indicates the server host is nowmoderately loaded with existing connections 701 b. As the ingress host104 a takes the load update into consideration when performing the loadbalancing protocol, the ingress host 104 a may assign the server hostfewer connections, which will result in server host 114 a being loadedwith fewer new connections 702 b as at least some of the connectionsdecay without being replaced by new connections. In FIG. 7C, server host114 a provides ingress host 104 a with a load update that indicates theserver host is now heavily loaded with existing connections 701 c. Asthe ingress host 104 a takes the load update into consideration whenperforming the load balancing protocol, the ingress host 104 a mayassign the server host even fewer connections, which will result inserver host 114 a being loaded with even fewer new connection(s) 702 cas many connections decay without being replaced by new connections. InFIG. 7D, server host 114 a provides ingress host 104 a with a loadupdate that indicates the server host is not currently accepting newconnections (e.g., because the host is too heavily loaded with existingconnections 701 d or otherwise does not want additional connections). Asthe ingress host 104 a takes the load update into consideration whenperforming the load balancing protocol, the ingress host 104 a may ceaseto assign the server host with new connections, which will result inserver host 114 a being loaded with no connections from ingress host 104a as connections 702 d completely decay.

In various embodiments, each ingress sever may also be configured toautomatically cease assigning new connections to a specific server hostif the ingress host has not received a load update from that server hostwithin a certain threshold period of time. For example, the server hostsmay be configured to send load updates as period heartbeat messages. Ifan ingress server determines that a specific server host has not sent aheartbeat within a specified period of time (e.g., a period of time setby an administrator or a period of time dynamically determined by theingress host itself), the specific ingress server may cease assigningnew connections to that server host. Note that ingress server maycontinue to forward packets for existing connections to the server host.However, as those existing connections die out over time, the ingressserver will have no active connections routed to the host server (unlessthe host server sends another heartbeat message that indicates it ishealth enough to accept new connections).

As demonstrated by FIGS. 7A-7D, while the load balancer of the ingresshost may perform load balancing according to a load balancing protocolin order to assign new connections to different server hosts, eachserver host can largely control the load (e.g., quantity of connections)exerted on that host by reporting specific load updates to the ingresshost. For instance, to throttle its own load down, the server host mayprovide (to a specific ingress host) a load update that indicates it isheavily loaded or that it is no longer accepting new connections. Invarious embodiments, a server host may utilize this technique to controlthe quantity of connections it receives from each individual ingresshost in various embodiments.

FIGS. 8A-8C illustrate flow diagrams demonstrating how a server host maytransition its source of connections from one load balancer to anotherin various embodiments. In FIG. 8A, server host 114 a has used a loadupdate 800 a to indicate to ingress host that the server is lightlyloaded with existing connection(s) 801 a. As the server host hascapacity to spare, the ingress host assigns new connections 802 a to theserver host in accordance with the load balancing protocol describedherein. Also note that, in FIG. 8A, the server host 114 a does notprovide load updates to ingress host 104 b and ingress host 104 b doesnot assign new connections to that server host.

In FIGS. 8B-8C, server host 114 a may transition the source of itsconnections from ingress host 104 a to ingress host 104 b. In variousembodiments, this type of configuration change may be performed withoutquiescing server host 114 a (e.g., without killing the activeconnections handled by server host 114 a). In other words, the serverhosts are not locked into a particular ingress host (and can evenconcurrently accept connections from multiple ingress hosts). In theillustrated embodiment, server host 114 a provides a load update 800 bto ingress host 104 a to indicate that the server host is no longeraccepting new connections from that ingress host. As such, the quantityof connections 802 b will decay until no connection is routed throughingress host 104 a to server host 114 a. Server host 114 a may alsoprovide a load update 804 b to ingress host 104 b that indicates theserver host is moderately loaded (or otherwise has some capacity toreceive connections from that ingress host). Depending on the currentload of the other server hosts, ingress host 104 b may begin to assignat least some connections 806 c to server host 114 a, as illustrated inFIG. 8C. Note that connections 802 b were closed by either server host114 a or the respective remote client; the connections were notprematurely killed by ingress host 104 a. As the capacity of the serverhost 114 a has improved, ingress server 104 b may assign additionalconnections 806 c to the server host. As demonstrated by the operationsof FIGS. 8A-8C, server host 114 a may use load updates to successfullychange the source of connections from one ingress host to another. Invarious embodiments, similar techniques may be utilized to control theflow of incoming connections in other ways. For instance, the serverhost might provide load updates to multiple ingress hosts in order toconcurrently receive new connections from multiple ingress hosts.

The system and method for distributed load balancing with load balancerclients for hosts may include various methods, examples of which aredescribed in more detail below. In various embodiments, the illustratedmethods may be performed by one or more computer systems, such ascomputer 1200 of FIG. 12 described below. FIG. 9 illustrates an examplemethod for processing a request packet including tracking the packet'soriginal destination address through the use of one-to-one mappingsbetween ports and different network addresses serviced by the systemdescribed herein. As illustrated by block 900, the method may includereceiving a request packet into a particular network region thatincludes multiple host computers. The request packet may be sent from aremote client to a first network address of multiple network addressesthat route to the particular network region. For instance, one exampleof this type of request packet includes request packet 200. In thisspecific example, the first network address may be 172.1.1.1:80, whichis the address to which the client sent the request packet. Asillustrated by block 902, the method may include selecting according toa load balancing protocol, one of the host computers for processing therequest packet. For instance, this portion of the method may includeapplying any of the load balancing protocols described above. In one-nonlimiting example, a round-robin load balancing protocol may be utilizedin conjunction with load updates from various host computers in order toselect a specific host computer that is to receive the request packet.

As illustrated at 904, the method may also include sending the requestpacket to the selected host computer on a specific port of the selectedhost computer. In various embodiments, this specific port may have aone-to-one association with the first network address and thisassociation may be specified by mapping information that indicatesone-to-one relationships between communication ports of the selectedhost computer and the multiple network addresses that route to theparticular network region. For instance, examples of this mappinginformation are described as being stored within configuration datastore 116 in various embodiments. An example of sending a request packetto this type of port is illustrated by the transfer of packet 200 fromingress server 104 a to LB client 112 a (FIG. 2). In this example, thehost computer address is 10.2.2.2 and the specific port number is 4001.

As illustrated at block 906, the method may include receiving therequest packet on the selected host computer. Note that in variousembodiments the information of the request packet does not specify thefirst network address to which the request packet was sent by the remoteclient. For instance, as described above the first network address(e.g., the address to which the client sent the packet) may bedetermined by the port on which the packet is received and the mappinginformation from the configuration data store.

As illustrated at block 908, the method may include, subsequent todetermining the first network address by comparing the port on which theselected host computer receives the request packet (e.g., port 4001 inFIG. 2) to the mapping information, sending a response packet to theremote client, the response packet indicating the first network addressas the source of the response packet. For instance, in FIG. 2, thesource address of the response packet 202 is indicated as 172.1.1.1:80(the same address to which the client originally sent the correspondingrequest packet) even though the egress host (address 10.3.3.3) actuallysent the response packet. In this way, from the clients perspective, theresponse packet appears to originate from the same address to which theoriginal request packet was sent by the client.

FIG. 10 illustrates an example method performing a load balancingtechnique for allocating connections away from a particular server hostwithout closing existing connections for that host. As illustrated atblock 1000, the method may include, for a given packet received from aremote client, selecting a host computer of a plurality of hostcomputers according to a load balancing protocol. In general, thisportion of the method may include utilizing any of the load balancingprotocols described above in regard to selecting server host. In variousembodiments, the selection is also based on load information updates(e.g., load updates 600, 702 a-d, 802 a-b, 806 b-c) received from thehost computers (e.g., from the LB clients of the server hosts describedherein). In various embodiments, each load information update mayindicate a measure of the respective host computer's capacity to serviceadditional connections with remote clients. For example, as describedabove, load information updates may specify a quantity or number ofconnections a host computer is willing to take on. In another example,load updates may specify whether a host computer (e.g., a server host114) is lightly, moderately, or heavily loaded or not accepting anyconnections. In other cases, the load updates may specify metrics forinferring the host computers capacity to accept additional connections.For instance, a given load update may specify a current number of activeconnections on the host, current bandwidth utilization of the host,current processor utilization of the host, current memory utilization ofthe host, or some other performance metric associated with the host. Asillustrated at block 1002, the method may include providing the givenpacket to the selected host computer for processing by an applicationcomponent on the selected host computer. For instance, an example ofdelivering the given packet may include providing the request packetsdescribed above to specific servers hosts for processing by one ofservers 110 a-d.

As illustrated at block 1004, the method may also include, subsequent todetermining that a specific application component on one of the hostcomputers does not have capacity for additional connections based on theload information updates, continue to send packets for existingconnections serviced by the specific application component to therespective host computer without sending packets for new connections tothat host computer. For instance, as described above, when an ingressserver determines that a server host does not have capacity for newconnections, the ingress server may continue sending packets forexisting connection to that server host while ceasing assignment of newconnections to that server host.

FIG. 11 illustrates an example method for transitioning the source ofconnections for a given host computer (e.g., a server host) from oneload balancer to another load balancer. In various embodiments, thismethod may be performed by an individual server host. For instance, theLB clients described herein may be configured to perform the illustratedmethod in order to transition from one load balancer to another.

As illustrated at block 1100, the method may be performed by a givenhost computer. Furthermore, the method may include receiving from afirst load balancer, packets for one or more active connections assignedto the given host computer by the first load balancer. For instance,this portion of the method may include receiving packets that a firstload balancer forwards in accordance with a forwarding table, asdescribed above. As described above, the load balancer (e.g., loadbalancer 400) may distribute connections to multiple host computersincluding the given host computer based on a load balancing protocol andload information updates received from those host computers. In variousembodiments, this property of the load balancer may be leverage toenable a server host to control the quantity of connections that itreceives from a particular load balancer. In fact, as noted below withrespect to block 1102, techniques may be employed to transition thesource of connection from one load balancer to another (e.g., from oneingress host to another).

As illustrated at block 1102, the method may include transitioning thesource of connections from the first load balancer to a second loadbalancer, by i) sending to the first load balancer a load informationupdate that indicates the given host does not have capacity to receiveadditional connections from the first load balancer, and ii) sending tothe second load balancer a load information update that indicates thatthe given host has capacity to receive connections from the second loadbalancer. For instance, one example of transitioning the source ofconnections in this manner is illustrated by the transition from ingressserver 104 a to ingress server 104 b in FIGS. 8A-8B.

Example Computer System

Various embodiments of the system and method for distributed loadbalancing with load balancer clients for hosts, as described herein, maybe executed on one or more computer systems, which may interact withvarious other devices. Note that any component, action, or functionalitydescribed above with respect to FIGS. 1-11 may be implemented on one ormore computers configured as computer system 1200 of FIG. 12, accordingto various embodiments. For instance, in one embodiment, a computersimilar to computer system 1200 may be utilized to implement any of theclients, ingress hosts, egress hosts, or server hosts (and any componentor elements thereof) described above. In the illustrated embodiment,computer system 1200 includes one or more processors 1210 coupled to asystem memory 1220 via an input/output (I/O) interface 1230. Computersystem 1200 further includes a network interface 1240 coupled to I/Ointerface 1230, and one or more input/output devices 1250, such ascursor control device 1260, keyboard 1270, and display(s) 1280. In somecases, it is contemplated that embodiments may be implemented using asingle instance of computer system 1200, while in other embodimentsmultiple such systems, or multiple nodes making up computer system 1200,may be configured to host different portions or instances ofembodiments. For example, in one embodiment some elements may beimplemented via one or more nodes of computer system 1200 that aredistinct from those nodes implementing other elements.

In various embodiments, computer system 1200 may be a uniprocessorsystem including one processor 1210, or a multiprocessor systemincluding several processors 1210 (e.g., two, four, eight, or anothersuitable number). Processors 1210 may be any suitable processor capableof executing instructions. For example, in various embodimentsprocessors 1210 may be general-purpose or embedded processorsimplementing any of a variety of instruction set architectures (ISAs),such as the x86, PowerPC, SPARC, or MIPS ISAs, or any other suitableISA. In multiprocessor systems, each of processors 1210 may commonly,but not necessarily, implement the same ISA.

System memory 1220 may be configured to store program instructions 1222and/or data 1232 accessible by processor 1210. In various embodiments,system memory 1220 may be implemented using any suitable memorytechnology, such as static random access memory (SRAM), synchronousdynamic RAM (SDRAM), nonvolatile/Flash-type memory, or any other type ofmemory. In the illustrated embodiment, program instructions 1222 may beconfigured to implement any functionality of the clients, ingress hosts,egress hosts, or server hosts (and any component or elements thereof)described above including but not limited to the load balancers and LBclients. Additionally, data 1232 of memory 1220 may store any of theinformation or data structures described above, including but notlimited to packets, mapping information, forwarding tables, server hoststates, and load updates. In some embodiments, program instructionsand/or data may be received, sent or stored upon different types ofcomputer-accessible media or on similar media separate from systemmemory 1220 or computer system 1200. While computer system 1200 isdescribed as implementing the functionality of functional blocks ofprevious Figures, any of the functionality described herein may beimplemented via such a computer system.

In one embodiment, I/O interface 1230 may be configured to coordinateI/O traffic between processor 1210, system memory 1220, and anyperipheral devices in the device, including network interface 1240 orother peripheral interfaces, such as input/output devices 1250. In someembodiments, I/O interface 1230 may perform any necessary protocol,timing or other data transformations to convert data signals from onecomponent (e.g., system memory 1220) into a format suitable for use byanother component (e.g., processor 1210). In some embodiments, I/Ointerface 1230 may include support for devices attached through varioustypes of peripheral buses, such as a variant of the Peripheral ComponentInterconnect (PCI) bus standard or the Universal Serial Bus (USB)standard, for example. In some embodiments, the function of I/Ointerface 1230 may be split into two or more separate components, suchas a north bridge and a south bridge, for example. Also, in someembodiments some or all of the functionality of I/O interface 1230, suchas an interface to system memory 1220, may be incorporated directly intoprocessor 1210.

Network interface 1240 may be configured to allow data to be exchangedbetween computer system 1200 and other devices attached to a network1285 (e.g., any element of FIG. 1-11) or between nodes of computersystem 1200. Network 1285 may in various embodiments include one or morenetworks including but not limited to Local

Area Networks (LANs) (e.g., an Ethernet or corporate network), Wide AreaNetworks (WANs) (e.g., the Internet), wireless data networks, some otherelectronic data network, or some combination thereof. In someembodiments, the network(s) 102 of FIG. 1 and FIG. 2 may include one ormore networks configured in a manner similar to that of network 1285. Invarious embodiments, network interface 1240 may support communicationvia wired or wireless general data networks, such as any suitable typeof Ethernet network, for example; via telecommunications/telephonynetworks such as analog voice networks or digital fiber communicationsnetworks; via storage area networks such as Fibre Channel SANs, or viaany other suitable type of network and/or protocol.

Input/output devices 1250 may, in some embodiments, include one or moredisplay terminals, keyboards, keypads, touchpads, scanning devices,voice or optical recognition devices, or any other devices suitable forentering or accessing data by one or more computer systems 1200.Multiple input/output devices 1250 may be present in computer system1200 or may be distributed on various nodes of computer system 1200. Insome embodiments, similar input/output devices may be separate fromcomputer system 1200 and may interact with one or more nodes of computersystem 1200 through a wired or wireless connection, such as over networkinterface 1240.

As shown in FIG. 12, memory 1220 may include program instructions 1222,which may be processor-executable to implement any element or actiondescribed above. In one embodiment, the program instructions mayimplement the methods described above, such as the methods illustratedby FIGS. 9-11. In other embodiments, different elements and data may beincluded. Note that data 1232 may include any data or informationdescribed above.

Those skilled in the art will appreciate that computer system 1200 ismerely illustrative and is not intended to limit the scope ofembodiments. In particular, the computer system and devices may includeany combination of hardware or software that can perform the indicatedfunctions, including computers, network devices, Internet appliances,PDAs, wireless phones, pagers, etc. Computer system 1200 may also beconnected to other devices that are not illustrated, or instead mayoperate as a stand-alone system. In addition, the functionality providedby the illustrated components may in some embodiments be combined infewer components or distributed in additional components. Similarly, insome embodiments, the functionality of some of the illustratedcomponents may not be provided and/or other additional functionality maybe available.

Those skilled in the art will also appreciate that, while various itemsare illustrated as being stored in memory or on storage while beingused, these items or portions of them may be transferred between memoryand other storage devices for purposes of memory management and dataintegrity. Alternatively, in other embodiments some or all of thesoftware components may execute in memory on another device andcommunicate with the illustrated computer system via inter-computercommunication. Some or all of the system components or data structuresmay also be stored (e.g., as instructions or structured data) on acomputer-accessible medium or a portable article to be read by anappropriate drive, various examples of which are described above. Insome embodiments, instructions stored on a computer-accessible mediumseparate from computer system 1200 may be transmitted to computer system1200 via transmission media or signals such as electrical,electromagnetic, or digital signals, conveyed via a communication mediumsuch as a network and/or a wireless link.

Various embodiments may further include receiving, sending or storinginstructions and/or data implemented in accordance with the foregoingdescription upon a computer-accessible medium. Generally speaking, acomputer-accessible medium may include a computer-readable storagemedium or memory medium such as magnetic or optical media, e.g., disk orDVD/CD-ROM, volatile or non-volatile media such as RAM (e.g. SDRAM, DDR,RDRAM, SRAM, etc.), ROM, etc. In some embodiments, a computer-accessiblemedium may include transmission media or signals such as electrical,electromagnetic, or digital signals, conveyed via a communication mediumsuch as network and/or a wireless link.

The methods described herein may be implemented in software, hardware,or a combination thereof, in different embodiments. In addition, theorder of the blocks of the methods may be changed, and various elementsmay be added, reordered, combined, omitted, modified, etc. Variousmodifications and changes may be made as would be obvious to a personskilled in the art having the benefit of this disclosure. The variousembodiments described herein are meant to be illustrative and notlimiting. Many variations, modifications, additions, and improvementsare possible. Accordingly, plural instances may be provided forcomponents described herein as a single instance. Boundaries betweenvarious components, operations and data stores are somewhat arbitrary,and particular operations are illustrated in the context of specificillustrative configurations. Other allocations of functionality areenvisioned and may fall within the scope of claims that follow. Finally,structures and functionality presented as discrete components in theexemplary configurations may be implemented as a combined structure orcomponent. These and other variations, modifications, additions, andimprovements may fall within the scope of embodiments as defined in theclaims that follow.

What is claimed is:
 1. A non-transitory computer-readable storagemedium, storing program instructions computer-executable on a given hostcomputer to implement a load balancer client configured to: receive froma first load balancer, packets for one or more active connectionsassigned to the given host computer by the first load balancer; whereinthe load balancer distributes connections to a plurality of hostcomputers including the given host computer based on a load balancingprotocol and load information updates received from those hostcomputers; transition the source of connections for the given hostcomputer from the first load balancer to a second load balancer, whereinto perform the transition the load balancer client is configured to:send a load information update to the first load balancer, wherein thatload information update specifies that the given host does not havecapacity to receive additional connections from the first load balancer;and send a load information update to the second load balancer, whereinthat load information update indicates that the given host has capacityto receive connections from the second load balancer.
 2. Thenon-transitory computer-readable storage medium of claim 1, wherein theload balancer client on the given host computer is configured to:receive connections with remote clients through multiple load balancers,and transition the given host computer to an offline state by sending toeach those load balancers, a load information update that specifies thatthe given host does not have capacity to receive additional connections.3. The non-transitory computer-readable storage medium of claim 1,wherein the packets for one or more active connections compriseencrypted information, wherein the load balancer client is configured todecrypt the encrypted information and provide the decrypted informationto an application component that resides on the given host computer. 4.The non-transitory computer-readable storage medium of claim 1, whereinthe load balancer client is further configured, prior to sending theload information update to the second load balancer, to generate theload information update for the second load balancer.
 5. A methodperformed on a given host computer, the method comprising: receivingfrom a first load balancer and a second load balancer, packets for oneor more active connections assigned to the given host computer by thefirst load balancer and the second load balancer; wherein each loadbalancer distributes connections to a plurality of host computersincluding the given host computer based on a load balancing protocol andload information updates received from those host computers;individually controlling a quantity of additional connections from thefirst load balancer and the second load balancer, wherein saidcontrolling comprises: sending a load information update to the firstload balancer, wherein that load information update specifies the givenhost computer's capacity to receive additional connections from thefirst load balancer; and sending a load information update to the secondload balancer, wherein that load information update indicates the givenhost computer's capacity to receive additional connections from thesecond load balancer.
 6. The computer-implemented method of claim 5,wherein the packets for one or more active connections compriseencrypted information, wherein the method comprises decrypting theencrypted information and providing the decrypted information to anapplication component that resides on the given host computer.
 7. Thecomputer-implemented method of claim 5, wherein each load informationupdate is specific to a different port on the given host computer. 8.The method of claim 5, wherein the load information update to the firstload balancer indicates a quantity of additional connections the givenhost computer will accept from the first load balancer on a particularport of the given host computer.
 9. The method of claim 5, wherein theload information update to the first load balancer indicates a number ofactive connections at the given host computer, an aggregate bandwidth atthe given host computer, a processor utilization at the given hostcomputer, another performance metric of the given host computer, or anycombination thereof.
 10. The method of claim 8, wherein the loadinformation update to the first load balancer indicates a differentquantity of additional connections the given host computer will acceptfrom the first load balancer on a different particular port of the givenhost computer.
 11. The method of claim 8, wherein the load informationupdate to the second load balancer indicates a different quantity ofadditional connections the given host computer will accept from thesecond load balancer on the particular port of the given host computer.12. The non-transitory computer-readable storage medium of claim 8,wherein, to generate the load information update for the second loadbalancer, the load balancer client is configured to indicate that thegiven host computer is lightly loaded, moderately loaded, or heavilyloaded.
 13. The non-transitory computer-readable storage medium of claim8, wherein, to generate the load information update for the second loadbalancer, the load balancer client is not configured to indicate anumber of additional connections from the second load balancer that thegiven host computer will accept.
 14. A system, comprising: at least oneprocessor; and a memory coupled to the at least one processor, whereinthe memory stores instructions, and wherein the instructions, whenexecuted by the at least one processor, cause the at least one processorto implement a server host configured to: receive, from one or more of aplurality of load balancers, an assignment of one or more connections toone or more respective host computers, wherein the one or moreconnections provide one or more packets from the one or more respectivehost computers; and individually control a quantity of additionalconnections from a particular load balancer and a different loadbalancer of the plurality of load balancers, comprising: sending aparticular load information update to the particular load balancer,wherein the particular load information update indicates a capacity toreceive at least a particular number of additional connections from theparticular load balancer; and sending a different load informationupdate to the different load balancer, wherein the different loadinformation update indicates a capacity to receive at least a differentnumber of additional connections from the different load balancer. 15.The system of claim 14, wherein, to individually control the quantity ofadditional connections, the server host is further configured todetermine the particular load information update based at least in parton a number of active connections at the server host, a currentprocessor utilization of the server host, a current memory utilizationof the server host, another performance metric of the server host, orany combination thereof.
 16. The system of claim 14, wherein, to sendthe particular load information update, the server host is configured toindicate a load level of the server host from a predefined scale ofmultiple load levels.
 17. The system of claim 14, wherein the serverhost is configured to indicate, via the particular number, a capacity toreceive at least one additional connection, and to indicate, via thedifferent number, no capacity to receive additional connections.
 18. Thesystem of claim 14, wherein the server host is configured to send theparticular load information update and the different load informationupdate proactively.
 19. The system of claim 14, wherein the server hostis configured to indicate, to the particular load balancer via theparticular load information update, that the server host is online. 20.The system of claim 16, wherein the predefined scale includesindications that the server host is lightly loaded, moderately loaded,heavily loaded, or is not accepting additional connections.